PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG016582t2
Common NameTCM_016582
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 686aa    MW: 75924.6 Da    PI: 5.3147
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG016582t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox31.43.3e-102263256
                      HHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox 32 eLAkklgLterqVkvWFqNrRakek 56
                      +L+++lgL+ rqVk+WFqNrR+++k
  Thecc1EG016582t2  2 KLSQELGLKPRQVKFWFQNRRTQMK 26
                      79********************998 PP

2START150.71.3e-471683952206
                       HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS...............SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S... CS
             START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv..............dsgealrasgvvdmvlallveellddkeqWdetla... 77 
                       la + ++elvk+   +ep+W +    eng e l  +e ++               +++ea r+s+vv+m++++lv  +ld + +W+e ++   
  Thecc1EG016582t2 168 LAMSSMDELVKMCRTNEPLWIRNN--ENGRELLNLEEHARMfpwapsnlkqrsteFRTEAGRDSAVVIMNSVTLVDAFLDAN-KWTELFPsiv 257
                       677899****************99..999999999988887999999***********************************.********** PP

                       .EEEEEEEECTT.....EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE....TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTC CS
             START  78 .kaetlevissg.....galqlmvaelqalsplvp.RdfvfvRyirq...lgagdwvivdvSvdseqkppesssvvRaellpSgiliepksng 160
                        +a+t++v+s g     g lqlm+aelq+lsplvp R+ +f+Ry++q   + +  w+ivd  +d   ++  ++s+   +++pSg+li++++ng
  Thecc1EG016582t2 258 aRAKTVQVVSAGvsgtnGSLQLMYAELQVLSPLVPtREAYFLRYCQQqnlDDETYWAIVDFPIDGFHNNL-QASFPLYRRRPSGCLIQDMPNG 349
                       ***********************************************99888889*********999998.67666666************** PP

                       EEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
             START 161 hskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                       +s+vtwveh++ +++ +h+++ ++v +g+a+ga +w+a l+rqce+
  Thecc1EG016582t2 350 YSRVTWVEHAEIEEKPVHQIFSHFVYNGMAFGAHRWLAVLERQCER 395
                       ********************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007112.244128IPR001356Homeobox domain
SuperFamilySSF466896.42E-8229IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.0E-9233IPR009057Homeodomain-like
CDDcd000861.00E-7229No hitNo description
PfamPF000461.4E-7226IPR001356Homeobox domain
PROSITE patternPS000270326IPR017970Homeobox, conserved site
PROSITE profilePS5084842.809158398IPR002913START domain
SuperFamilySSF559617.56E-29159397No hitNo description
CDDcd088751.87E-115162394No hitNo description
SMARTSM002349.9E-32167395IPR002913START domain
PfamPF018525.7E-40168395IPR002913START domain
SuperFamilySSF559612.89E-16413652No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 686 aa     Download sequence    Send to blast
MKLSQELGLK PRQVKFWFQN RRTQMKAQQD RSDNVILRAE NESLKNEFYR LQAELSKLVC  60
PNCGGPAVPG GISFEELRIE NARLREELER VCAIASRYIG RPIQTMGAAP ALMPPSLDLD  120
MNMYPRHFTE PMASCTEMMP VPMLPETASF PENNLVLVEE EKTVAMELAM SSMDELVKMC  180
RTNEPLWIRN NENGRELLNL EEHARMFPWA PSNLKQRSTE FRTEAGRDSA VVIMNSVTLV  240
DAFLDANKWT ELFPSIVARA KTVQVVSAGV SGTNGSLQLM YAELQVLSPL VPTREAYFLR  300
YCQQQNLDDE TYWAIVDFPI DGFHNNLQAS FPLYRRRPSG CLIQDMPNGY SRVTWVEHAE  360
IEEKPVHQIF SHFVYNGMAF GAHRWLAVLE RQCERVASLM ARNITDLGVI PSPEARKNLM  420
RLAQRMIRTF CVNISTSSGQ LWTALPDSAD DTVRITTRKV TEAGQPNGLI LCAVSTTWLP  480
YPHDQVFDLL RDERSRSQLE VLSNGNALHE VAHIANGAHP GNCISLLRIN VASNSSQHVE  540
LMLQESCTDR SGSLVVYSTV DVDSVQLAMS GEDPSCIPLL PLGFFITPVE LIRDASDDQG  600
KSVPPSEEAN GHISGSLLTV GLQVLASTVP SAKINLSSIA AINNHLCTTV HQITAALSSS  660
TAPSCPDNGI GVLGSCTEPA SAPEK*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007040684.10.0Homeobox-leucine zipper protein HDG5 isoform 1
RefseqXP_007040685.10.0Homeobox-leucine zipper protein HDG5 isoform 2
SwissprotQ9FJS20.0HDG5_ARATH; Homeobox-leucine zipper protein HDG5
TrEMBLA0A061G5U70.0A0A061G5U7_THECC; Homeobox-leucine zipper protein HDG5 isoform 1
TrEMBLA0A061G6L90.0A0A061G6L9_THECC; Homeobox-leucine zipper protein HDG5 isoform 2
STRINGPOPTR_0003s09470.10.0(Populus trichocarpa)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]